Automated judgment of document qualities

نویسندگان

  • Kwong Bor Ng
  • Paul B. Kantor
  • Tomek Strzalkowski
  • Nina Wacholder
  • Rong Tang
  • Bing Bai
  • Robert Rittman
  • Peng Song
  • Ying Sun
چکیده

the assessment of document qualities such as depth and objectivity. The primary purpose is to develop a qualitysensitive functionality, orthogonal to relevance, to select documents for an interactive question-answering system. The study consisted of two stages. In the classifier construction stage, nine document qualities deemed important by information professionals were identified and classifiers were developed to predict their values. In the confirmative evaluation stage, the performance of the developed methods was checked using a different document collection. The quality prediction methods worked well in the second stage. The results strongly suggest that the best way to predict document qualities automatically is to construct classifiers on a person-by-person basis.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Institutional Dimension of Document Quality Judgments

In addition to relevance, there are other factors that contribute to the utility of a document. For examples, content properties like depth of analysis and multiplicity of viewpoints, and presentational properties like readability and verbosity, all will affect the usefulness of a document. These kinds of relevance-independent properties are difficult to determine, as their estimations are more...

متن کامل

Adjectives as Indicators of Subjectivity in Documents

The goal of this research is to automatically predict human judgments of document qualities such as subjectivity, verbosity and depth. In this paper, we explore the behavior of adjectives as indicators of subjectivity in documents. Specifically, we test whether a subset of automatically derived subjective adjectives (Wiebe, 2000b), selected a priori, behaves differently than other adjectives. 3...

متن کامل

Measuring the quality of multi-document cluster headlines

Headline summaries of multi-document clusters enable efficient navigation and selection of content, provided headlines are of sufficient quality. This study compares several methods for automated headline extraction, with considerable variation in length. The reliability of the automated evaluation is validated by a comparison with human produced headlines, taking into consideration the variabi...

متن کامل

What Qualities Do Users Prefer in Diversity Rankings?

Novelty and diversity ranking aims to provide individual users or groups of users with the documents that will cover a space of information needs or different aspects of a single information need [7, 11]. Most approaches to diversity evaluation require a list of subtopics that either disambiguate a short query or give further specification of aspects of the underlying information need. Document...

متن کامل

Automated classification of pulmonary nodules through a retrospective analysis of conventional CT and two-phase PET images in patients undergoing biopsy

Objective(s): Positron emission tomography/computed tomography (PET/CT) examination is commonly used for the evaluation of pulmonary nodules since it provides both anatomical and functional information. However, given the dependence of this evaluation on physician’s subjective judgment, the results could be variable. The purpose of this study was to develop an automated scheme for the classific...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JASIST

دوره 57  شماره 

صفحات  -

تاریخ انتشار 2006